AITopics | Zhuhai

Machine learning models in high-stakes applications, such as recidivism prediction and automated personnel selection, often exhibit systematic performance disparities across sensitive subpopulations, raising critical concerns regarding algorithmic bias. Fairness auditing addresses these risks through two primary functions: certification, which verifies adherence to fairness constraints; and flagging, which isolates specific demographic groups experiencing disparate treatment. However, existing auditing techniques are frequently limited by restrictive distributional assumptions or prohibitive computational overhead. We propose a novel empirical likelihood-based (EL) framework that constructs robust statistical measures for model performance disparities. Unlike traditional methods, our approach is non-parametric; the proposed disparity statistics follow asymptotically chi-square or mixed chi-square distributions, ensuring valid inference without assuming underlying data distributions. This framework uses a constrained optimization profile that admits stable numerical solutions, facilitating both large-scale certification and efficient subpopulation discovery. Empirically, the EL methods outperform bootstrap-based approaches, yielding coverage rates closer to nominal levels while reducing computational latency by several orders of magnitude. We demonstrate the practical utility of this framework on the COMPAS dataset, where it successfully flags intersectional biases, specifically identifying a significantly higher positive prediction rate for African-American males under 25 and a systemic under-prediction for Caucasian females relative to the population mean.

artificial intelligence, disparity, machine learning, (19 more...)

arXiv.org Machine Learning

2601.20269

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > China > Fujian Province > Xiamen (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(4 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.92)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

Sparse Convex Biclustering

Jiang, Jiakun, Xiang, Dewei, Gu, Chenliang, Liu, Wei, Wang, Binhuan

arXiv.org Machine LearningJan-6-2026

Biclustering is an essential unsupervised machine learning technique for simultaneously clustering rows and columns of a data matrix, with widespread applications in genomics, transcriptomics, and other high-dimensional omics data. Despite its importance, existing biclustering methods struggle to meet the demands of modern large-scale datasets. The challenges stem from the accumulation of noise in high-dimensional features, the limitations of non-convex optimization formulations, and the computational complexity of identifying meaningful biclusters. These issues often result in reduced accuracy and stability as the size of the dataset increases. To overcome these challenges, we propose Sparse Convex Biclustering (SpaCoBi), a novel method that penalizes noise during the biclustering process to improve both accuracy and robustness. By adopting a convex optimization framework and introducing a stability-based tuning criterion, SpaCoBi achieves an optimal balance between cluster fidelity and sparsity. Comprehensive numerical studies, including simulations and an application to mouse olfactory bulb data, demonstrate that SpaCoBi significantly outperforms state-of-the-art methods in accuracy. These results highlight SpaCoBi as a robust and efficient solution for biclustering in high-dimensional and large-scale datasets.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

2601.01757

Country:

North America > United States (0.14)
Asia > China > Beijing > Beijing (0.04)
Asia > China > Sichuan Province > Chengdu (0.04)
Asia > China > Guangdong Province > Zhuhai (0.04)

Genre: Research Report > Promising Solution (0.54)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.71)

Add feedback

China launches massive aerial drone carrier in show of prowess

The Japan TimesDec-11-2025, 08:06:00 GMT

Flags flutter as soldiers participate in a military parade to mark the 80th anniversary of the end of World War II, in Beijing in September. The maiden flight of the unmanned Jiutian drone mothership has highlighted its advances in unmanned aerial vehicles capable of unleashing weaponized swarms. China conducted the maiden flight of what is considered to be the world's largest drone mothership, underscoring its advances in unmanned aerial vehicles capable of unleashing weaponized swarms. The unmanned Jiutian completed its first mission in the northwestern province of Shaanxi, the official Xinhua News Agency reported Thursday without elaborating. The aerial vehicle has been likened to an aircraft carrier for its ability to host multiple drones and missiles.

artificial intelligence, launch massive aerial drone carrier, social media, (11 more...)

The Japan Times

Country:

Asia > China > Beijing > Beijing (0.27)
Asia > China > Shaanxi Province (0.25)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.08)
(7 more...)

Industry:

Aerospace & Defense > Aircraft (1.00)
Government > Military > Navy (0.36)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.93)
Information Technology > Communications > Social Media (0.79)

Add feedback

The Oracle and The Prism: A Decoupled and Efficient Framework for Generative Recommendation Explanation

Zhang, Jiaheng, Zhang, Daqiang

arXiv.org Artificial IntelligenceDec-9-2025

The integration of Large Language Models (LLMs) into explainable recommendation systems often leads to a performance-efficiency trade-off in end-to-end architectures, where joint optimization of ranking and explanation can result in suboptimal compromises. To resolve this, we propose Prism, a novel decoupled framework that rigorously separates the recommendation process into a dedicated ranking stage and an explanation generation stage. This decomposition ensures that each component is optimized for its specific objective, eliminating inherent conflicts in coupled models. Inspired by knowledge distillation, Prism leverages a powerful, instruction-following teacher LLM (FLAN-T5-XXL) as an Oracle to produce high-fidelity explanatory knowledge. A compact, fine-tuned student model (BART-Base), the Prism, then specializes in synthesizing this knowledge into personalized explanations. Our extensive experiments on benchmark datasets reveal a key finding: the distillation process not only transfers knowledge but also acts as a noise filter. Our 140M-parameter Prism model significantly outperforms its 11B-parameter teacher in human evaluations of faithfulness and personalization, demonstrating an emergent ability to correct hallucinations present in the teacher's outputs. While achieving a 24x speedup and a 10x reduction in memory consumption, our analysis validates that decoupling, coupled with targeted distillation, provides an efficient and effective pathway to high-quality, and perhaps more importantly, trustworthy explainable recommendation.

explanation, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2511.16543

Country: